Autotuning and Self-Adaptability in Concurrency Libraries

نویسندگان

  • Thomas Karcher
  • Christopher Guckes
  • Walter F. Tichy
چکیده

Thomas Karcher Institute for Program Structures and Data Organization Karlsruhe Institute of Technology 76128 Karlsruhe, Germany [email protected] Christopher Guckes Institute for Program Structures and Data Organization Karlsruhe Institute of Technology 76128 Karlsruhe, Germany [email protected] Walter F. Tichy Institute for Program Structures and Data Organization Karlsruhe Institute of Technology 76128 Karlsruhe, Germany [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Autotuning dense linear algebra libraries on GPUs

As GPUs are quickly evolving in complexity, tuning numerical libraries for them is becoming more challenging. We present an autotuning approach in the area of dense linear algebra (DLA) libraries for GPUs. The MAGMA library is used to demonstrate the techniques and their effect on performance and portability across hardware systems. We show that, figuratively speaking, our autotuning approach f...

متن کامل

Autotuning divide-and-conquer stencil computations

This paper explores autotuning strategies for serial divide-and-conquer stencil computations, comparing the efficacy of traditional “heuristic” autotuning with that of “pruned-exhaustive” autotuning. We present a pruned-exhaustive autotuner called Ztune that searches for optimal divide-and-conquer trees for stencil computations. Ztune uses three pruning properties — space-time equivalence, divi...

متن کامل

Experiences in autotuning matrix multiplication for energy minimization on GPUs

In this paper, we report extensive results and analysis of autotuning the computationally intensive graphics processing units kernel for dense matrix–matrix multiplication in double precision. In contrast to traditional autotuning and/or optimization for runtime performance only, we also take the energy efficiency into account. For kernels achieving equal performance, we show significant differ...

متن کامل

Autotuning of Pattern Runtimes for Accelerated Parallel Systems

Parallel architectures with node-level accelerators promise significant performance improvements over conventional homogeneous systems. To cope with the increased complexity of programming such systems various pattern-based programming libraries have become available. In this paper we present our work on providing autotuning capabilities for two runtime libraries that provide parallel programmi...

متن کامل

An Evaluation of Autotuning Techniques for the Compiler Optimization Problems

Diversity of today’s architectures have forced programmers and compiler researchers to port their application across many different platforms. Compiler auto-tuning itself plays a major role within that process as it has certain levels of complexities that the standard optimization levels fail to bring the best results due to their average performance output. To address the problem, different op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1405.2918  شماره 

صفحات  -

تاریخ انتشار 2014